Import and warehouse data

Data cleansing, Data analysis and visualisation

Data pre-processing, Model training, testing and tuning

The model with highest K-Fold Validation Accuracy score is Logistic Regression with an accuracy of 79.84.

Conclusion

The company can now look at the data and predict the customers likes and dislikes. The bivariate and multivariate analysis can give more insights for the company to take decisons wisely and strategize their markeing value chain. This will make executives to conduct the effective customer retention programmes.

To get more insight on churn, pertaining to this data set, the parameters like internet speed and international calling should be included. I enjoyed working on this data set and it was effective.